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EM.ESTIMATE (ti, p^, 2, S,. x,. .... xj,. A') 

{ FOR EACH ROW R OF A WITH MISSING VALUES 
{ FOR/=1 K 

{ USE EM AND A/(^i., Sj) TO ESTIMATE THE 

MISSING VALUES IN/?. 
Ri^R WITH MISSING VALUES REPUCED BY ESTIMATES. 
} 

} 

R ^ WEIGHTEDAVERAGE(/?i 

REPLACE/? IN A' BY/?. 

RETURN A'. 

} 

K_ESTIMATE(/(, A) 

{ /* FIRST PART: INITIALIZATION 7 

B ^ ROWS OF A WITHOUT MISSING VALUES. 

GAUSSIAN MIXTURE CLUSTERING OF a 

A' ^ EM_ESTIMATE p^. S^, t,, t^, A). 

r SECOND PART: ITERATION V 
REPEAT 

{ Ml. -.^K.2i.....2:K.fv-.'^K^ 

GAUSSIAN MIXTURE CLUSTERING OF A\ 

A- ^ EM_ESTIMATE i^,, S^x, x^. Al 

} UNTIL CONVERGENCE 
} 

GMCimpute(S, A) 

{ F0R/C=1,2....,S 

{ Ak*-K_ESTIMATE{/<,A). 

RETURN (A,+ A2 + ...+ As) /S. 



(57) Abstract: Clustering is routinely applied in the exploratory 
analysis of microarray data. Missing entries arise from blemishes 
on the microarray s. The present invention provides a new method, 
and computer program and/or computer product thereof to impute 
missing values. The method involves the steps of clustering 
microarray data by partitioning the data into a select number of 
clusters, wherein each data point is iteratively moved from one 
cluster to another, until two consecutive iterations have resulted in 
the same partition pattern; obtaining a select number of estimates of 
the data in the clusters by probabilistic interference; and averaging 
the select number of estimates to obtain missing values in the 
microarray data. The method is superior to other imputation models 
as measured by root mean squared enx}rs. 
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